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Abstract. We consider the problem of representing, in a compressed format, a bit- 
vector S of m bits with n Is, supporting the following operations, where b G {0, 1}: 

• rank6(S', i) returns the number of occurrences of bit b in the prefix S 

• select;, (S, i) returns the position of the ith occurrence of bit b in S. 

Such a data structure is called fully indexable dictionary ( fid ) [Raman, Raman, and Rao, 
2007], and is at least as powerful as predecessor data structures. Viewing 5 as a set 
X — {x\,X2, ■ ■ ■ ,Xn} of n distinct integers drawn from a universe [m] = {1, . . . , m}, the 
predecessor of integer y £ [m] in X is given by selecti(S', ranki(S', y — 1)). fids have 
many applications in succinct and compressed data structures, as they are often involved 
in the construction of succinct representation for a variety of abstract data types. 

Our focus is on space-efficient fids on the RAM model with word size 0(lgm) and 
constant time for all operations, so that the time cost is independent of the input size. 

Given the bitstring 5* to be encoded, having length m and containing n ones, the 
minimal amount of information that needs to be stored is B{n,m) = [log('^)]. The 
state of the art in building a fid for S is given in [Patra§cu, 2008] using B{m, n) -f 
0(m/((log m/t)*)) -f 0(m^''*) bits, to support the operations in 0(t) time. 

Here, we propose a parametric data structure exhibiting a time/space trade-off such 
that, for any real constants < S < 1/2, < e < 1, and integer s > 0, it uses 

B{n,m) + o(n^+' + n(^^y^ 

bits and performs all the operations in time 0{sS^^ -\- e~^). The improvement is twofold: 
our redundancy can be lowered parametrically and, fixing s — 0(1), we get a constant- 
time FID whose space is B{n,m) + 0(m^/poly(n)) bits, for sufficiently large m. This is a 
significant improvement compared to the previous bounds for the general case. 

'Center for Massive Data Algorithmics, a center of the Danish National Research Foundation 
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1. Introduction 

Data structures for dictionaries O [271 [SI EZ] , text indexing [5l [13 [22l IH [3ll [32] , and 
representing semi-structured data [HI [HI [TSj [301 [SZ] , often require the very space-efficient 
representation of a bivector 5 of m bits with n Is (and m — n Os). Since there are (^) 
possible choices of n Is out of the m bits in S, a simple information-theoretic argument 
shows that we need at least B{n, m) = [log (™)] bits of space, in the worst case, to store S 
in some compressed format. However, for the aforementioned applications, it is not enough 
just to store the compressed S, as one would like to support the following operations on S, 
for b G {0,1}: 

• rankfc(S', i) returns the number of occurrences of bit b in the prefix S 

• selectf,(S', z) returns the position of the ith occurrence of bit h in S. 

Our focus will be on space-efficient data structures that support these operations ef- 
ficiently, on the RAM model with word size 0(logm). The resulting data structure is 
called a fully indexable dictionary (fid) [37] and is quite powerful. For example, S can 
equally represent a set X = {xi,X2, . . . of n distinct integers drawn from a universe 
[m] = {1, . . . , m}, where S [xi\ = 1, for 1 < i < n, while the remaining m — n bits of S are 
Os. In this context, the classical problem of finding the predecessor in X of a given integer 
y ^ V^A the greatest lower bound of y in X) can be solved with two fid queries on S by 
select]^(S', ranki(S', y — 1)). FiDs have also connections with coding theory, since they rep- 
resent a sort of locally decodable source code for S" [1] . They are at the heart of compressed 
text indexing since they enable space to be squeezed down to the high-order entropy when 
properly employed [20]. Finally, they are the building blocks for many complex low space 
data structures [21 [9l [281 [29] that require 0(1) lookup time, namely, their time complexity 
is independent of the number of entries stored at the expense of using some extra space. 

To support the rank and select operations in 0{t) time, for some parameter t, it 
appears to be necessary to use additional space, beyond the bound B{n, m) needed for rep- 
resenting the bitstring S in compressed format. This extra space is termed the redundancy 
R{n, m, t) of the data structure, and gives a total of B{n, m) + R{n, m, t) bits [13]. Although 
the leading term B{n, m) is optimal from the information-theoretic point of view, a discrep- 
ancy between theory and practice emerges when implementing fids for various applications 
[IHl [211 EHl [33l [39]. In particular, the term B{n, m) is often of the same order as, if not 
superseded by, the redundancy term R{n,m,t). For example, consider a constant-time fid 
storing n = o(m/polylog(m)) integers from the universe [m]: here, B{n,m) is negligible 
when compared to the best known bound of R{n,m, 1) = 0(m/polylog(m)) |35| . 

Our goal is that of reducing the redundancy R{n, m, t) for the general case n < m. 
Although most of the previous work has generally focussed on the case t = 0(1), and 
m = n ■ polylog(n), the burgeoning range of applications (and their complexity) warrant a 
much more thorough study of the function R(n,m,t). 

There are some inherent limitations on how small can the redundancy R{n, m, t) be, 
since FiDs are connected to data structures for the predecessor problem, and we can in- 
herit the predecessor lower bounds regarding several time/space tradeoffs. The connection 
between fids and the predecessor problem is well known [H [23l [36l [37] and is further de- 
veloped in this paper, going beyond the simple inheritance of lower bounds. A predecessor 
data structure which gives access to the underlying data set is, informally, a way to support 
half the operations natively: either select]^ and ranki, or selectg and ranko. In fact, we 
show that a data structure solving the predecessor problem can be turned into a fid and 
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can also be made to store the data set using B{n, m) + 0{n) bits, under certain assumptions 
over the data structure. 

Consequently, if we wish to understand the limitations in reducing the redundancy 
R{n, m, t) of the space bounds for fids, we must briefly survey the state of the art for the 
lower bounds involving the predecessor problem. The work in [36] shows a number of lower 
bounds and matching upper bounds for the predecessor problem, using data structures 
occupying at least Q{n) words, from which we obtain, for example, that R{n, m, 1) can be 
o{n) only when n = polylog(m) (a degenerate case) or m = npolylog(n). For m = n^^^\ 
the lower bound for B{n,m) + R{n,m, 1) is ^}{n^~^^) for any fixed constant 6 > 0. Note 
that in the latter case, B{n,m) = 0(n log m) = o{R{n,m, 1)), so the "redundancy" is larger 
than B{n, m). Since ranki is at least as hard as the predecessor problem, as noted in p^[36]. 
then all fids suffer from the same limitations. (It is obvious that ranko and ranki have the 
same complexity, as ranko(5, i) + raiiki(S', i) = i.) As noted in [371 Lemma 7.3], selectg 
is also at least as hard as the predecessor problem. Other lower bounds on the redundancy 
were given for "systematic" encodings of S (see [131 [HI [26] and related papers), but they 
are not relevant here since our focus is on "non-systematic" encodings [n\ [T8] , which have 
provably lower redundancy. (In "non-systematic" encodings one can store S in compressed 
format.) 

In terms of upper bounds for R{n,m,t), a number are known, of which we only enu- 
merate the most relevant here. For systematic structures, an optimal upper bound is given 
by [Ej for R{n, m, 0(1)) = 0{m log log m/ log m). Otherwise, a very recent upper bound in 
[35j gives R{n,m,t) = 0(m/((logm)/t)* -|- m^/^polylog(m)) for any constant t > 0. These 
bounds are most interesting when m = n-polylog(n). As noted earlier, sets that are sparser 
are worthy of closer study. For such sets, one cannot have best of two worlds: one would 
either have to look to support queries in non-constant time but smaller space, or give up 
on attaining i?(n, m, 1) = o{B{n,m)) for constant-time operations. 

The main role of generic case fids is expressed when they take part in more structured 
data structures (e.g. succinct trees) where there is no prior knowledge of the relationship 
between n and m. Our main contribution goes along this path, striving for constant-time 
operations. Namely, we devise a constant-time fid having redundancy R{n,m,0{l)) = 
0{n^~^^ + n{m/n^Y), for any fixed constants 6 < 1/2, e < 1 and s > (Theorem 13. ip . 
The running time of the operations is always 0(1) for selectj^ (which is insensitive to 
time-space tradeoffs) and is 0{e~^ + s6~^) = 0(1) for the remaining operations. When m 
is sufficiently large, our constant-time fid uses just B{n,m) + 0{m^ /poly{n)) bits, which 
is a significant improvement compared to the previous bounds for the general case, as we 
move from a redundancy of kind 0(m/polylog(m)) to a one of kind O(m^), by proving for 
the first time that polynomial reduction in space is possible. 

Moreover, when instantiated in a polynomial universe case (when m = Q{n^^^^), for 
a sufficiently small e, the redundancy is dominated by n^+^, thus extending the known 
predecessor search data structure with all four fid operations without using a second copy 
of the data. Otherwise, the term is dominant when the universe is super polynomial, 
e.g. when m = 0(2^°^""') for c > 1. In such cases we may not match the lower bounds 
for predecessor search; however, this is the price for a solution which is agnostic of m, n 
relationship. 

We base our findings on the Elias-Fano encoding scheme [3 [8], which gives the basis 
for FIDS naturally supporting select]^ in 0(1) time. 
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2. Elias-Fano Revisited 

We review how the EHas-Fano scheme [TJ [51 [33l [39] works for an arbitrary set X = 
{xi < ■ ■ ■ < Xn} of n integers chosen from a universe [m]. RecaU that X is equivalent 
to its characteristic function mapped to a bitstring S of length m, so that S [xi] = 1 for 
1 < i < n while the remaining m — n bits of S are Os. Based on the Elias-Fano encoding, we 
will describe the main ideas behind our new implementation of fully indexable dictionaries 
(fids). We also assume that n < m/2 — otherwise we build a FID on the complement set of 
X (and still provide the same functionalities), which improves space consumption although 
it does not guarantee select^ in 0(1) time. 

Elias-Fano encoding. Let us arrange the integers of X as a sorted sequence of consecutive 
words of log m bits each. Consider the firs10 [log n] bits of each integer Xj, called hi, where 
1 < i < n. We say that any two integers and xj belong to the same superblock if hi = hj. 

The sequence hi <h2 < ■ ■ ■ <hn can be stored as a bitvector H in 3n bits, instead of 
using the standard nflogn] bits. It is the classical unary representation, in which an integer 
X > is represented with x Os followed by a 1. Namely, the values /ii, /12 — /ii, . . . , /i„ — hn-i 
are stored in unary as a multiset. For example, the sequence hi, /12, /13, /14, /is = 1, 1, 2, 3, 3 
is stored as H = 01101011, where the ith \ va. H corresponds to hi, and the number of 
Os from the beginning of H up to the ith 1 gives hi itself. The remaining portion of the 
original sequence, that is, the last logm — [logn] bits in Xi that are not in hi, are stored 
as the rth entry of a simple array L. Hence, we can reconstruct Xi as the concatenation of 
hi and L [i], for \ < i < n. The total space used by H is at most 2!^^°^"^ + n < 3n bits and 
that used by L is n x (logm — [logn]) < nlog(m/n) bits. 

Interestingly, the plain storage of the bits in L is related to the information-theoretic 
minimum, namely, nlog(m/n) < B{n,m) bits, since for n < m/2, B{n,m) ~ nlog(m/n) + 
1.44 n by means of Stirling approximation. In other words, the simple way of representing 
the integers in X using Elias-Fano encoding requires at most nlog(m/n) + 3n bits, which is 
nearly 1.56 n away from the theoretical lower bound B{n, m). If we employ a constant-time 
FID to store H, Elias-Fano encoding uses a total of B{n,m) + 1.56 n + o(n) bits. 

Rank and select operations vs predecessor search. Using the available machinery — 
the FID on H and the plain array L — we can perform selecti(i) on X in 0(1) time: we first 
recover hi = selecti(i/, i) —i and then concatenate it to the fixed-length L [i] to obtain Xi 
in 0(1) time [22j. As for rank and selecto, we point out that they are intimately related 
to the predecessor search, as we show below (the converse has already been pointed out in 
the Introduction). 

Answering ranki(/c) in X is equivalent to finding the predecessor Xi of k in X, since 
ranki(A:) = i when Xi is the predecessor of k. Note that ranko(/c) = k — ranki(A;), so 
performing this operation also amounts to finding the predecessor. As for selectQ(i) in 
X, let X = [m] \ X = {vi,V2, ■ ■ ■ ,Vm-n} be the complement of X, where Vi < Wj+i, for 
1 < i < m — n. Given any 1 < i < m — n, our goal is to find selectQ(i) = Vi in constant 
time, thus motivating that our assumption n < m/2 is w.l.o.g.: whenever n < m/2, we 
store the complement set of X and swap the zero- and one-related operations. 

The key observation comes from the fact that we can associate each xi with a new value 
yi = \{vj S X such that Vj < which is the number of elements in X that precede xi, 

"'^Here we use Elias' original choice of ceiling and floors, thus our bounds slightly differ from the adarray 
structure of [33], where they obtain n[log(m/n)] + 2n. 
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where 1 < I < n. The relation among the two quantities is simple, namely, yi = xi — l, as we 
know that exactly / — 1 elements of X precede xi and so the remaining elements that precede 
xi must originate from X. Since we will often refer to it, we call the set Y = {yi,y2, ■ ■ ■ , J/n} 
the dual representation of the set X. 

Returning to the main problem of answering selectQ(i) in X, our first step is to find 
the predecessor yj of i in Y, namely, the largest index j such that yj < i. As a result, we 
infer that Xj is the predecessor of the unknown Vi (which will be our answer) in the set X. 
We now have all the ingredients to deduce the value of Vi. Specifically, the y^th element of 
X occurs before xj in the universe, and there is a nonempty run of elements of X up to and 
including position xj , followed by i — yj elements of X up to and including (the unknown) 
Vi- Hence, Vi = Xj + i — yj and, since yj = Xj — j, we return Vi = Xj + i — Xj+j = i + j. 
(An alternative way to sec Vi = i + j is that xi,X2, ■ ■ ■ ,Xj are the only elements of X to the 
left of the unknown Vi.) We have thus proved the following. 

Lemma 2.1. Using the Elias-Fano encoding, the select;^ operation takes constant time, 
while the rank and selectg operations can he reduced in constant time to predecessor search 
in the sets X and Y , respectively. 

The following theorem implies that we can use both lower and upper bounds of the 
predecessor problem to obtain a FID, and vice versa. Below, we call a data structure storing 
X set-preserving if it stores X\,...,Xn verbatim in a contiguous set of memory cells. 

Theorem 2.2. For a given set X of n integers over the universe \m], let FlD(t, s) he a FID 
that takes t time and s hits of space to support rank and select. Also, let PRED(t, s) he a 
static data structure that takes t time and s bits of space to support predecessor queries on 
X, inhere the integers in X are stored in sorted order using nlogm < s bits. Then, 

(1) given a FlD{t,s), we can obtain a PRED(0(t), s); 

(2) given a set-preserving PRED(t, s), we can obtain a FiD(0(i),s — nlogn + 0{n)) 
(equivalently, R{n,m,t) = s — nlogm -\- 0{n)) with constant-time select j^. 

(3) if there exists a non set-preserving PRED(t, s), we can obtain a FlD(0(t), 2s + 0(n)) 
with constant-time select^. 

Proof (sketch). The first statement easily follows by observing that the predecessor of k in 
X is returned in 0(1) time by select;^(S', ranki(5', — 1)), where S is the characteristic 
bitstring of X. Focusing on the second statement, it suffices to encode X using the Elias 
Fano encoding, achieving space s — nlogn + 0{n). 

To further support selectg, we exploit the properties of Y and X. Namely, there exists 
a maximal subset X' C X so that its dual representation Y' is strictly increasing, thus being 
searchable by a predecessor data structure. Hence we split X into X' and the remaining 
subsequence X" and produce two Elias-Fano encodings which can be easily combined by 
means of an extra 0(n) bits fid in order to perform select^, ranki and ranko. selectg 
can be supported by exploiting the set preserviness of the data structure, thus building only 
the extra data structure to search Y' and not storing Y'. When data structures are not 
set-preserving, we simply replicate the data and store Y', thus giving a justification to the 
0() factor. 
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3. Basic Components and Main Result 

We now address and solve two questions, which are fundamental to attain a 0(i)-time 
FID with B{n, m) +R{n, m, t) bits of storage using Lemma [2. II and Theorem 12 .21 (1) how to 
devise an efficient index data structure that can implement predecessor search using Elias- 
Fano representation with tunable time-space tradeoff, and (2) how to keep its redundancy 
R{n, m, t) small. 

Before answering the above questions, we give an overview of the two basic tools that 
are adopted in our construction (the string B-tree [lOj and a modified van Emde Boas 
tree [Ml EH]). We next develop our major ideas that, combined with these tools, achieve 
the desired time-space tradeoff, proving our main result. 

Theorem 3.1. Let s > be an integer and let < e,5 < 1 be reals. For any bitstring S, 
\S\ = m, having cardinality n, there exists a fully indexable dictionary solving all operations 
in time 0{s6~^ + £~^) using B{n, m) + 0{n^^^ + n{m/n^Y) bits of space. 

Modified van Emde Boas trees. Patra§cu and Thorup [36j have given some matching 
upper and lower bounds for the predecessor problem. The discussion hereafter regards the 
second branch of their bound: as a candidate bound they involve the equation (with our 
terminolgy and assuming our word RAM model) t = log(log(m/n)/ log(2:/n)), where t is 
our desired time bound and z is the space in bits. By reversing the equation and setting 
e = 2~*, we obtain z = Q{n{m/nY) bits. As mentioned in [36j, the tradeoff is tight for 
a polynomial universe m = n"' , for 7 > 1, so the above redundancy cannot be lower than 
6(n^+^) for any fixed 6 > 0. 

They also describe a variation of van Emde Boas (veb) trees [38] matching the bound 
for polynomial universes, namely producing a data structure supporting predecessor search 
that takes 0(log \°g^™/^-)^ )) time occupying 0(2; log m) bits. In other words, for constant- 
time queries, we should have log(m/n) ~ log(z/n), which implies that the space is 2: = 
Q{n{m/nY). They target the use of their data structure for polynomial universes, since 
for different cases they build different data structures. However, the construction makes 
no assumption on the above relation and we can extend the result to arbitrary values of 
m. By Theorem 12.21 we can derive a constant-time fid with redundancy R{n,m,0{l)) = 
0{n{m/nY). 

Corollary 3.2. Using a modified veb tree, we can implement a fid that uses B(n,m) + 
0{n{m/nY) bits of space, and supports all operations in 0(log(l/e)J time, for any constant 
e > 0. 

The above corollary implies that we can obtain a first polynomial reduction by a 
straightforward application of existing results. However, we will show that we can do 
better for sufficiently large m, and effectively reduce the term n{m/nY to n^~^^ + n(m / n'^Y . 
The rest of the paper is devoted to this goal. 

String B-Tree: blind search for the integers. We introduce a variant of string B-tree 
to support predecessor search in a set of integers. Given a set of integers X = {xi, . . . , Xp} 
from the universe [it], we want obtain a space-efRcient representation of X that supports 
predecessor queries efhciently. We develop the following structure: 

Lemma 3.3. Given a set X of p integers from the universe [v\, there exists a representa- 
tion that uses extra 0(j)loglogu) bits apart from storing the elements of X, that supports 
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predecessor queries on X in O (log p/ log log n) time. The algorithm requires access to a 
precomputed table of size 0{u^) hits, for some positive constant 7 < 1, which can he shared 
among all instances of the structure with the same universe size. 

Proof. The structure is essentially a succinct version of string B-tree on the elements of X 
interpreted as binary strings of length logu, with branching factor h = 0{\J\ogu). Thus, it 
is enough to describe how to support predecessor queries in a set of h elements in constant 
time, and the query time follows, as the height of the tree is O (log p/ log log n). Given a set 
xi, X2, • • • , Xf, of integers from [u] that need to be stored at a node of the string B-tree, we 
construct a compact trie (Patricia trie) over these integers (interpreted as binary strings of 
length logii), having b leaves and 6 — 1 internal nodes. The leaves disposition follows the 
sorting order of X. Each internal node is associated with a skip value, indicating the string 
depth at which the LCP with previous string ends. Canonically, left-pointing edges are 
labeled with a and right-pointing with a 1. Apart from storing the keys in sorted order, 
it is enough to store the tree structure and the skip values of the edges. This information 
can be represented using 0(61oglogu) bits, as each skip value is at most logu and the trie 
is represented in 0{h) bits. 

Given an element y G [n], the search for the predecessor of y proceeds in two stages. 
In the first stage, we simply follow the compact trie matching the appropriate bits of y to 
find a leaf v. Let Xi be the element associated with leaf v. One can show that Xi is the 
key that shares the longest common prefix with y among all the keys in X. In the second 
stage, we compare y with Xi to find the longest common prefix of y and Xi (which is either 
the leftmost or rightmost leaf of the internal node at which the search ends). By following 
the path in the compact trie governed by this longest common prefix, one can find the 
predecessor of y in X. We refer the reader to [10] for more details and the correctness of the 
search algorithm. The first stage of the search does not need to look at any of the elements 
associated with the leaves. Thus this step can be performed using a precomputed table of 
size 0{u^) bits, for some positive constant 7 < 1 (by dividing the binary representation 
of y into chunks of size smaller than ^\ogu bits each). In the second stage, finding the 
longest common prefix of y and Xi can be done using bitwise operations. We again use the 
precomputed table to follow the path governed by the longest common prefix, to find the 
predecessor of y. ■ 

4. Main Ideas for Achieving Polynomial Redundancy 

In this section, we give a full explanation of the main result. Theorem 13.11 We first 
give an overview, and then detail the multiranking problem by illustrating remaining details 
involving the construction of our data structure. 

4.1. Overview of our recursive dictionary 

We consider the ranki operation only, leaving the effective development of the details 
to the next sections. A widely used approach to the fid problem (e.g. see [25l [27]) lies in 
splitting the universe [m] into different chunks and operating independently in each chunk, 
storing the rank at the beginning of the block. Queries are redirected into a chunk via a 
preliminary distributing data structure and the local data structure is used to solve it. Thus, 
the space occupancy is the distributing structure (once) plus all chunks. Our approach is 
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orthogonal, and it guarantees better control of the parameter of subproblems we instantiate 
with respect to many previous approaches. 

Let X {\X\ = n) be the integer sequence of values drawn from [m] and let q G [m] 
be a generic rank query. Our goal is to produce a simple function / : [m] [m/n] and a 
machinery that generates a sequence X from X of length n coming from the universe [m/n], 
so that given the predecessor of q = f{q) in X, we can recover the predecessor of g in X. 
By this way, we can reduce recursively, multiple times, the rank problem while keeping a 
single sequence per step, instead of having one data structure per chunk. 

Easily enough, / is the "cutting" operation of the upper logn bits operated by the 
Elias Fano construction, which generates p different superblocks. Let X\, . . . ,Xp the sets 
of lower log(m/?7-) bits of values in X, one per superblock. We define our X as X = 
Ui<i<pXl, that is, the set of unique values we can extract from the X's. Suppose we have 
an oracle function -0, so that given a value x £ X and an index j € \p], ip{j,x) is the 
predecessor of x in Xj. We also recall from Section 2 that the upper bit vector H of the 
Elias Fano construction over X can answer the query ranki(x/2r^°^"^) in constant time (by 
performing select q{H,x/ 2^^°^'^^). That is, it can give the rank value at the beginning of 
each superblock. 

Given a query q we can perform ranki(g) in the following way: we use H to reduce 
the problem within the superblock and know the rank at the beginning of the superblock j. 
We then have the lower bits of our query {f{q)) and the sequence X: we rank f{q) there, 
obtaining a certain result, say v; we finally refer to our oracle to find the predecessor of v into 
Xj, and thus find the real answer for ranki(g). The main justification of this architecture 
is the following: in any superblock, the predecessor of some value can exhibit only certain 
values in its lower bits (those in X), thus once given the predecessor of f{q) our necessary 
step is only to reduce the problem within \\X\\ as the lower bits for any superblock are a 
subset of X. The impact of such choice is, as explained later, to let us implement the above 
oracle in just 0{'n}'^^) bits, for any < < 1. That is, by using a superlinear number of 
bits in n, we will be able to let m drop polynomially both in n and m. 

The above construction, thus, requires one to write X in an Elias Fano dictionary, 
plus the oracle space and the space to solve the predecessor problem on X. The first part 
accounts for B{n,m) + 0(n) bits, to which we add 0{n^^^) bits for the oracle. By carefully 
employing the String B-tree we can shrink the number of elements of X to 0(n/ log^n) 
elements, leaving us with the problem of ranking on a sequence of such length and universe 
[m/n]. We solve the problem by replicating the entire schema from the beginning. Up to 
the final stage of recursion, the series representing the space occupancy gives approximately 
0((n log (m/n) )/ log^* n + {n/ log^^ n)^~^^) bits at the i-th step, descending geometrically. 
Each step can be traversed in constant time during a query, so the overall time is constant 
again. More interestingly, at each step we reduce the universe size of the outcoming sequence 
to mn~\ Thus, at the final step s, we employ the previous result of Corollary 13. 21 and obtain 
a final redundancy of 0(m^n^~*^). 

4.2. Multiranking 

We now give further details on our construction. Mainly, we show that using our choice 
on how to build X and the function /, being able to rank over X we can build the oracle in 
0{n^^^) bits. We do it by illustrating, in a broader framework, the multiranking problem. 
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We are given a universe [u] (in our dictionary case, we start by setting u = m), and a 
set of nonempty sequences Ai, . . . ,Ac each containing a sorted subset of [u]. We also define 
r = Yli<j<n the global number of elements. The goal is, given two values 1 < i < c 

(the wanted superblock s) and 1 < q < u (the query f{q)), perform raiiki(q') in the set Ai 
(in our case, the head in s that is predecessor of the searched key) in 0(1) time and small 
space. 

A trivial solution to this problem would essentially build a fid for each of the sequences, 
thus spending a space proportional to 0{cu), which is prohibitive. Instead, we can carefully 
exploit the global nature of this task and solve it in less space. The core of this technique 
is the universe scaling procedure. We perform the union of all the A sequences and extract 
a new, single sequence A containing only the distinct values that appear in the union (that 
is, we kill duplicates). A is named the alphabet for our problem and we denote its length 
with t < r. Next, we rewrite all sequences by using rank of their elements in the alphabet 
instead of the initial arguments: now each sequence is defined on [t]. 

The multiranking problem is solved in two phases. We first perform ranking of the 
query g on A and then we exploit the information to recover the predecessor in the given 
set. Here we achieve our goal to (i) decouple a phase that depends on the universe from 
one that dependes on the elements and (ii) have only one version of the problem standing 
on the initial universe. The following lemma solves the multiranking problem completely, 
that is, outside our original distinction between a oracle and the alphabet ranking: 

Lemma 4.1. There exists a data structure solving the multirank problem over c nonempty 
increasing sequences A = {Ai, . . . , Ac} with elements drawn from the universe [u], having r 
elements in total using B{r,u) + 0{r^^^) + o{u) bits for any given < 6 < 1/2. 

Proof. Let A be the alphabet defined over u by the sequences in A, and let t = |A|. For 
each of the sequences in A we create a bitvector /3j of length t where the Pij = 1 if Aj £ Ai . 
We first view /3jS as rows of a matrix of size tc; since t < r and each of the sequences 
are non-empty (and hence r > c), the matrix is of size O(r^). We linearize the matrix 
by concatenating its rows and obtain a new bitvector /?' on which we want to perform 
predecessor search. We note that the universe size of this bitvector is O(r^), that is, the 
universe is polynomial. We store P' using the data structure of Corollary 13.21 setting the 
time to log (1/(5), so that space turns out to be 0{r^~^^). Finally, we store we store a fid 
occupying B{r,u) + o{u) that represents the subset A of the universe [u]. 

Solving the multirank is easy now: given a query q and a set index i, we use the o{u) 
fid and find A = ranki(g) in U, which leads to the predecessor into the alphabet A of our 
query q. Since A G [t] we can now use the P FID to find p = raiiki(ti + A). The final answer 
is clearly p — raiiki(ti). ■ 

4.3. Completing the puzzle 

The multiranking problem is closely connected with the Elias-Fano representation of 
Section [2j When plugged in our framework, as explained in Section 14.11 that we can use 
our data structure itself to implement the ranking procedure. Similarly we can use it for 
selectQ by employing another set of data. 

We are left with just one major detail. Each time we produce the output sequence X, 
containing the lower bits for all elements, our only clue for the number of elements is the 
worst case upper bound n, which is unacceptable. We now review the whole construction 
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and employ the string B-tree to have a poly logarithmic reduction on the number of elements, 
paying O(nloglogm) bits per recursion step. Generally, at each step we receive a sequence 
Xi as input and must output a new sequence Xj+i plus some data structures that can link 
the predecessor problem for Xi to Xi^i. Each Xi is stored in an Elias-Fano dictionary, and 
the sets of superblocks and lower bits sequences are built as explained before. We then apply 
a further reduction step on the problem cardinality. Each superblock can be either slim or 
fat depending on whether it contains less than log^ n elements or not. Each superblock is 
split into blocks of size log^ n, apart from the last block, and for each block we store a String 
B-tree with fan-out \/log n. Since the block is polylogarithmic in size, by means of shared 
precomputed tables we can perform predecessor search in constant time. Slim superblocks 
are handled directly by the tree and they do not participate further in the construction. For 
each block in a fat superblock, we logically extract its head, that is, the smallest element 
in it. We now use heads in the multiranking problems and we build the output sequence 
using only heads lower bits. As there can only be at most 0(n/log^n) blocks in fat 
superblocks, the size of the output sequence is at most 0(n/log^n). The oracle is built as 
usual, on the heads, using 0(n^+'^) bits. 

Ranking now performs the following steps: for each recursive step, it uses the Elias- 
Fano H vector to move into a superblock and at the same time check if it is slim or fat. In 
the latter case, it first outsources the query for the lower bits to the next dictionary, then 
feeds the answer to the multiranking instance and returns the actual answer. Thus, we just 
proved the following (with v = log^ n and w = n): 

Theorem 4.2. Let w and v be two integer parameters and letO < 6 < 1/2 be a real constant. 
Given Xi,ni > v and rrii > w, where Ui < rrii, there exists a procedure that produces a data 
structure involved in predecessor search. The data structure occupies B{ni,mi) + 0{w + 
nj log log mi + nj^^) space, and in 0{6~^) time, it reduces a predecessor query on Xi to a 
predecessor query on a new sequence Xj+i of length rij+i = 0{ni/v) over a universe [mj+i], 
where mj+i = rrii/w. 

We must then deal with the last two steps. The first step aims at supporting selectp 
since the above data structure can only support ranki. The second step deals with how 
treat the final sequence after a number of iteration steps have been executed. We can finally 
give the proof of our main result: 

Proof of Theorem \3.1\ Let X C [m] be the set whose characteristic vector is S. The data 
structure involves recursive instances of Theorem 14.21 by starting with Xq = X and using 
each step's output as input for the next step. As previously mentioned, we must only 
cover the base case and the last recursive step. We begin by describing the whole data 
structure, moving to algorithms later on. We start by partitioning X into X' and X" as 
described in the proof of Theorem 12. 2^ so that the construction is operated on both X' and 
X" . We now describe representation of X'] X" is stored in a similar way. We recursively 
build smaller sequences by invoking Theorem 14.21 exactly s times, using 5 as given, and 
parameters w = n, v = log^m. By invoking Corollary 13.21 the space bound easily follows. 
To support selectQ on the original sequence, we operate on the X' sequence alone, since 
when transformed to its dual Y', we obtain a strictly monotone sequence. Interpreting X' 
as an implicit representation of Y' we build a multiset representation for the high bits {H'), 
a new set of succinct string B-trees using the superblocks of the dual sequence and thought 
of as operating on Y' (similarly to Theorem 12.21) and a new set of s recursive applications 
of Theorem 14.21 
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select ]^ is trivial, thanks to the machinery of Theorem 12.21 The ranki algorithm for 
a query q is performed on both X' and X" FID: we start by querying Hq, the upper bits 
of Fq (Fq respectively) for q/2^^°^'^\ thus identifying a certain superblock in which the 
predecessor for q can appear. Unless the superblock is slim (refer to proof of Theorem 14. 2 p 
we must continue to search through the next lower-order bits. This is done via multiranking, 
which recurses in a cascading manner with the same technique on the s steps up to the last 
FID, that returns the answer. The chain is then walked backwards to find the root fid 
representative. We finally proceed through the succinct string B-tree to find the head 
and the next succinct string B-tree until we find the predecessor of q. The last step for 
recursion takes 0{e~^) time. All the middle steps for multiranking and succinct string B- 
tree traversals take 0{s6^^ + s) time. To support selectg, we act on X' , using exactly the 
same algorithm as before using, but with the collection of data structures built for the dual 
representation Y' , and following the steps of Theorem 12. 2[ 

During the buildup of the recursive process, say being at step i, the size n'- for sequence 
Xl {i > 1), is upper bounded by n/log^*m, while the universe has size m/n^. If at any 
step 2 < j < s the condition mj < w = n does not apply, we cannot apply Theorem 14.21 
so we truncate recursion and use a o{w) fid to store the sequence Xj. This contributes a 
negligible amount to the redundancy. We name the fid for each step Fi up to Fg. Suppose 
we can recurse for s steps with Theorem 14.21 we end up with a sequence over a universe 
nis = mjrf . By using Corollary 13.21 the space bound is no less than 0{n{m/n^y). The 
B{ni,mi) + 0{nl^^) factors decrease geometrically, so the root dominates and we can show 
that, apart from lower order terms, the space bound is as claimed. Otherwise, the total 
space s{ni,mi) of the recursive data structure satisfies: 

s{ni,mi) = s(nj_(.i, mj+i) + space(FiD for high bits) + space(string B-trees) + 0{nl~^^) 

where rij+i = rij/log^m and mj+i = nii/n. The claimed redundancy follows easily. ■ 
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